Best Available Copy 

JEuropaisches Patentamt 
European Patent Office ® Publication number: 0 1 63 406 

Office europeen des brevets ° ^ 

© EUROPEAN PATENT SPECIFICATION 

® Dateofpublicationofpatentspecification:09.01.91 ® Int. CI. 5 : C 12 N 15/62, C 12 N 15/67, 
w C 12 P 19/34, C 12 N 1/20, 

® Application number: 85302744.9 C 12 P 21/02, C 07 K 7/40 

© Date of filing: 18.04.85 



® Novel polypeptide expression method. 



CO 

10 
O 



CL 
LU 



(3) Priority: 01.05.84 CA 453270 



(§) Date of publication of application: 
04.12.85 Bulletin 85/49 



Publication of the grant of the patent: 
09.01.91 Bulletin 91/02 



Designated Contracting States: 
BECHDEFRGBITLINLSE 



References cited: 
EP-A-154576 
EP-A-0 040466 
US-A-4342 832 
US-A-4366 246 



® Proprietor: Canadian Patents and Development 
Limited Soctete Canadlenne des Brevets et 
d'Explortation Limitee 
275 Slater Street 
Ottawa Ontario, K1 A 0R3 (CA) 

® Inventor: Garvin, Robert T. 
33 DeForest 

Toronto Ontario M6S 1J1 (CA) 
Inventor: Shen, Shi-Hsiang 
15 Rockford Drive Apt. 1 105 
Willowdale Ontario M2R 3A3 (CA) 
Inventor: James, Eric 
3303 Don Mills Road Apt. 2903 
Willowdale Ontario M2J 3A3 (CA) 

@ Representative: Burford, Anthony Frederick 
etal 

W.H. Beck, Greener & Co. 7 Stone Buildings 

Lincoln's Inn 

London WC2A3SZ(GB) 



Note: Within nine months from the publication of the mention of the grant of the European patent, any person may 
give notice to the European Patent Office of opposition to the European patent granted. Notice of opposition shall 
be filed in a written reasoned statement. It shall not be deemed to have been filed until the opposition fee has been 
paid (Art. 99(1 ) European patent convention). 

Courier Press, Leamington Spa, England. 



EP 0163 406 B1 

Description 

Bacteria, in common with mammalian and other eucaryotic cells, possess a proteolytic system for the 
selective degradation of abnormal or no longer needed polypeptides. In the natural course of events, 
5 abnormalities may arise by either mutation or biosynthetic error. Alternatively, a normal polypeptide may 
be needed in the ceil only at a specific time to carry out a specific vital function while, at any other time, the 
presence of that polypeptide is deleterious to the cell, so that, once the polypeptide has carried out its 
function, it is selectively destroyed. These operations imply a complex network of interacting regulatory 
and structural elements, and, indeed, in Escherichia co/i, the best studied bacterial system, eight different 
70 endoproteases have so far been described which function, individually or in concert, in this selective 
degradation system. . 

The introduction of foreign genes into a microorganism often has inadvertently activated this 
proteolytic system, thereby inhibiting the large scale production of a desired gene product by the practice 
of genetic engineering. This problem is particularly acute when genes coding for eucaryotic peptides are 
75 introduced into the bacteria. The strategy for circumventing this problem has been to fuse the nucleotide 
sequence coding for the desired gene product to at least part of a structural gene, or carrier protein gene, 
whose product is known to be constitutively stable. 

The tandem arrangement of desired gene product and carrier protein results in the expression from the 
organism of a hybrid polypeptide which is stable, provided that the carrier protein is large enough to 
20 identify the entire hybrid polypeptide as both native and constitutively stable. The prior art difficulty of 
direct expression of eucaryotic polypeptides and the necessity to use a carrier protein to achieve 
expression is well recognized and has been described in a number of scientific papers and published patent 
applications and patents, for example, in U.S. Patent No. 4,342,832. A typical patent which describes the 
use of this technique, for the production of somatostatin, is U.S. Patent No. 4,366,246. 
25 A disadvantage to this approach, however, is that the desired product constitutes only a small portion 
of the hybrid polypeptide, thereby decreasing significantly the theoretical yield, and necessitating further 
processing to split off the native carrier protein and to purify the desired polypeptide product. Efforts to 
decrease the native carrier moiety to a small portion of the hybrid product in an attempt to improve the 
yield, however, render the expressed hybrid polypeptide unstable. 
30 The symbols, abbreviations and some of the terms used herein have the following meanings: 
DNA — deoxyribonucleic acid 
A— Adenine 
T— Thymine 
G — Guanine 
35 C— Cytosine 

Tris— 2-Amino-2-hydroxy-ethyM ,3-propane diol 
EDTA— Ethylene diamine tetraacetic acid 
ATP— Adenosine triphosphate 
TTP— Thymidine triphosphate 
40 GTP— -Guanosine triphosphate 

T4^-Bacteriophage specific to £ coli 
Nucleotide— Nucleoside phosphate 

Codoi>— Three nucleotide sequences specifying a particular amino acid 
Oligonucleotide— Polymer consisting of a number of nucleotides 
45 Piasmid— Self-replicating, extrachromosomal genetic element 

In accordance with the present invention, it has now surprisingly been found that this prior art problem 
can be satisfactorily overcome and the natural, selective proteolytic mechanisms of the microorganisms 
can be circumvented to achieve high yield production of a desired eucaryotic polypeptide. This is achieved 
by tandemly-Iinking a plurality of polypeptide genes through selectively cleavable amino acid sequences, 
so incorporating the resulting DNA construct into a cloning vehicle, and transforming a microorganism with 

the cloning vehicle. ^ klA ^ , ^ t 

in accordance with the present invention, therefore, there is provided a DNA fragment composed of 
repeating units coding for a desired product of known amino acid sequence tandemly-linked by 
oligonucleotides coding for a selectively cleavable amino acid sequence and having no codons for 

55 operative combinations of stop and start signals between the ends of said fragment, whereby the fragment 
is expressible in a suitable vehicle to produce a polypeptide composed of tandemly-linked repeating units 
of the said desired product. . 

By providing a DNA fragment composed of repeating units coding for a desired product of known 
amino acid sequence and separated by oligonucleotides coding for selectively cleavable amino acid units, 

6o a replicabie cloning vehicle can be provided which is capable of stably expressing a product in a microbial 
organism. Once the product has been expressed during growth of the microorganism, it may be isolated 
from the cell culture, and the desired polypeptide product is obtained from the isolated product by cleaving 
the repeating units at the cleaving sites to form a plurality of individual units. Since multiple copies of the 
desired polypeptide product are expressed from the microorganism, the yield of product is considerably 

65 enhanced with respect to the prior art expression procedure discussed above. 
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The oresent invention has general applicability to the enhancement of stability of Pp^peptides «n 

SSThormone. somatostatin and ^!»»*™' » ^^^ZSSSLilm units 
e Essential to the present invention is the formation otaUiMATragmemwniu. . . . ?t whnq0 

depend on the desired product evnrp^ion Droducts refers to a material that has 

,nVe FS n re 2 is a derivation and restriction site map of plasmid pta*. 239-3PI; 

Sre6^^ 
2S '^CTeTfeTderivation and restriction site map of plasmid ****** 

30 with the preferred embodiment hereof, a DMA ^2i£lS?n£T32 for seleciively-cleavable 
coding for human proinsulin JiS?^ 

units, which, in the final product, permit ^^eparabon of the mar ™ ? , are absent from the 

cloning vshld., in both a fused and an unhand ' »" B S»at.on, » illuM n» »d in ^ ^ 
eonlioumtion. the human proinaulm coding sn» I U JjJ J^LctosMnse. In the unfused 

configuration, as compared with the i ""^"^J™^ is too unsta ble to be isolated as a product 
In both configurations, the single gene human , utneBt are added to the genetic 
from the microorganism but as additional human P^sul n s eque in ^ 
construct by the method of Figure 1. the ^neSn «lomeine in the unfused 

stable, approaching normal stab.lrty at the fused configuration. As increasing 

configuration, and at two tandemly-lmked l p £^^*™"d? Maximum number of coding sequences is 
numbers of proinsulin coding sequences are tandemly Imked, the numbe ? of tandemly- 
reached. beyond which the yield of polymer dechnes f ^^'^'"^ ? n a decreased amount of 
linked units in the unfused configuration ^^'.^SS^fciS 5TproIf»ulln coding wenca. In 
polymeric product being produced by the cell his P?f"™™ u ™? n the £ sed configuration. 

the «KSE»^^ in *■ embodiment i,,ustrated in 

55 Figure 1 is represented as follows: 

Proinsulin gene-Arg Arg Asn Ser Met-Proinsuiin gene 
AGG CGC AATTCT ATG 

linking units and modes of cleaving which may be adopted. cleavage may be used 
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junction, while for those polypeptides which do contain the methionine group, the design of a particular 
amino acid sequence positioned between desired polypeptide segments to provide an easily-cleavable site 
may be effected by genetic engineering methods. For example, polypeptides which do not contain an 
aspartyl-proHne bond may be engineered to contain such a bond between each unit of the mu ti-cooing 
5 sequence construct, so that recovery of the native material from the isolated multi-domain polypeptide 
may be achieved foliowingxleavage by acids, as described by Jauregui-Adelt and Marti, Anal. Biochem. 69, 

468 ln 9 general, enzymatic methods or chemical reactions which are amino acid residue-sequence specific 
may be used to effect cleavage of linking groups between the desired polypeptide genes. 
w The authentic human proinsulin molecule contains no methionine residues, so that the cyanogen 
bromide treatment cleaves a multi-domain proinsulin molecule into several proinsulin analog moieties 
along with the C-terminal authentic proinsulin unit. This results from the designed placement of 
methionine codons at the 5'-end of each of the proinsulin coding sequence units. 

The individual proinsulin units derived in this manner may be further processed into authentic human 
15 insulin by digestion with trypsin and carboxypeptidase B, or by any other convenient procedure. 

The formation of a recombinant plasmid containing an oligonucleotide sequence coding for multiple 
copies of human proinsulin leads to a considerable increase in yield of proinsulin and proinsulin analogs, 
■ when compared with the expression in the prior art of the hybrid molecule containing the p-galactosidase 

gene. , 
20 As seen in Figure 1, the preferred sequence to form the novel plasmids p7*ac/2PI and plac 239/2P1, and 
other multiple PI unit plasmids especially the expression plasmids p7ac/5Pl and plac 239/3PI which is 
further engineered as illustrated in Figure 8 to form the preferred expression plasmid plac-3P1, starts irom 
the plasmid pAT/PIR which contains an oligonucleotide sequence between the EcoRI and BamHI restriction 
sites coding for human proinsulin. The derivation from the known plasmids pBCA4 of the plasmid pAT/PIR, 
25 which is also a novel plasmid, is illustrated in Figure 4. The pTac and plac 239 fragments are denved from 
plasmids p7*ac/Pl and plac 239/Pl respectively, which in turn are both derived from another novel plasmid, 
plac 504/PI, as shown in Figure 6. The derivation of plasmid plac 504/Pl from known plasmids is also shown 

in Figure 6. _ , . /B1 . 

It is not essential to obtain the pTac and plac vectors from the plasmids pTac/PI and plac/PI i.e. 

30 plasmids containing a single copy of the proinsulin gene. They may be derived from any other convenient 
plasmid which contains the Tac and lac promoter respectively, and which can be cleaved at EcoRI and Bam 
HI sites to form the vector for ligation with the multiple proinsulin gene fragment. Similarly, it is not 
essential to form the plasmid plac 504/PI, i.e. also containing the single copy of the proinsulin gene, as an 
intermediate, but any other convenient intermediate plasmid may be employed. The particular single copy 

35 proinsulin gene plasmids here embodied were formed by the inventors during the experimentation which 
led to this invention in an attempt to provide constructs from which proinsulin would be stably expressed. 
As noted earlier, this was not found to be possible with the single copy gene. 

Specific experimental details for the formation of the various plasmids set forth in the drawings and 
disclosed above are discussed in the Examples below. 

40 Detailed derivations and major restriction site maps for the novel plasmids plac 239-3PI, p/ac/brM, pAi/ 
PIR and plac 504/PI appear as Figures 2, 3, 5 and 7 respectively. E. colt JM103 modified by eacrrof the 
plasmids plac 239-3PI and p7ac/5PI, have been deposited on July 6, 1984 with the American Type Culture 
Collection located at Rockvilie, Maryland, U.S.A., and have been accorded the following accession Nos.: 

45 Organism £ co//7plasmid plac 239-3PI— ATCC 39760 
Organism £ co//7piasmid p7ac/5Pt-ATCC 39759. 
The invention is illustrated further by the following Examples. 

^ TOs Example describes the formation of novel plasmids containing multiple units of the human 
proinsulin gene. 

r Wn an author of that Daper of Cornell University, Ithaca, N.Y., u.b.A. 

The DlasmW pBCAA was cut with Fokl and BamHI and the Fokl-BamHI fragment «^ -solated^ Two 
svntletic o\ SKtides were created and ligated with the Foki-BamHl fragment to extend he 5 -end of 
Se fl sequel f om Fokl to EcoRI, the altered EcoRI-ended PI sequences bemg des.gnated PIR. The 
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synthetic nucleotide sequences were generated chemically and comprised the sequences: 

5' AATTCTATGTTTGTCAAT 3' 

3' GATACAAACAGTTAGTCG 5' 

The starting piasmid pAT153 is a high copy variant of the well-known plasmid pBR322 with the so- 
called "poison" sequences removed and is described by Twigg et a!, Nature, 283, 216 (1980), The plasmid 
pAT153 was obtained from Professor 0. Smithies, Dept. of Genetics, University of Wisconsin— Madison, 
Madison, Wisconsin, U.SA The piasmid pAT153 was cut with EcoRi and BamHI and the vector isolated. 
The isolated vector was ligated to the EcoRI-BamHI (PIR) fragment, to form the new piasmid pAT/PIR. 

(b) Preparation of multiple copies of proinsulin gene 

The sequence of operations required to form multiple copies of the proinsulin gene is outlined in 
Figure 1. Synthetic oligonucleotides (A) and (B) were prepared chemically to provide the following 
sequences: 

(A) 5' CCTCT ACC AG CTG G AG AACTACTG C AAC AG G CGC 3' 

(B) 3' ATG GTCG ACCTCTTG ATG ACGTTGTCCGCGTTAA 5' 

200 p.moles of oligonucleotide B and 200 p.moles of oligonucleotide A having its 5'-end labelled with 
32 P-ATP, were mixed in 30 ui of 66 mM Tris-HCI (pH 7.6), 6.6 mM MgCI 2 and 10 mM dithiothreitol, and 
heated at 80°C for 15 minutes to anneal the oligonucleotides to form a part of the human proinsulin coding 
sequence beginning at the SfaNI cut site near the 3'-end of the coding sequence and terminating with the 
last proinsulin codon (AAC), followed by the additional sequences: 

5' AGG CGC 3' 
3' TCCGCGTTAA 5' 

These additional sequences of the synthetic oligonucleotides encode two arginine residues and 
provides a sticky end for ligation to an EcoRI sticky end. The ligation destroys the EcoRI site because of the 
CG base pair (boxed in Figure 1). 

10 ug of pAT/PIR, prepared as described in (a) above, was digssted for 1 hour at 37°C with 40 units each 
of EcoRI and BamHI in 40 ul of digestion buffer. The digested DNA was dephosphorylated by digestion at 
37°C for 30 minutes with 4 units of calf intestine alkaline phosphatase in 200 mM Tris-HCI, pH 8.0. The 
proinsulin gene fragment (PIR) was isolated from 1% agarose gel by phenol extraction. The isolated 
fragment was further digested for 3 hours at 37°C with 6 units of SfaNI in 40 ul of digestion buffer and the 
EcoRI-SfaNI fragment of PIR was isolated. . 

100 p.moles of the annealed oligonucleotides (A) and (B) and approximately 4 p.moles of the isolated 
PIR EcoRI-SfaNI fragment were ligated for 14 hours at 15°C in 30 ul of 66 mM Tris-HCI (pH 7.6), 6.6 mM 
MgCI 2 , 10 mM dithiothreitol and 1 mM ATP using 20 units of T4 DNA ligase. As a result of 
dephosphorlyation of the proinsulin gene fragment prior to SfaNI cleavage, and phosphorlyation of only 
oligonucleotide (A), as described above, the reaction was unidirectional, resulting in ligation of the 
proinsulin gene fragment PIR to the annealed oligonucleotide. The ligation reaction product was 
electrophoresed on 1.5% agarose gel and the resultant proinsulin gene analog construct (PI Analog) was 
isolated from the gel. 

The isolated proinsulin gene analog sequence (approximately 3 p.moles) was phosphorylated with 
ATP and T4 polynucleotide kinase. 10 ug of plasmid pAT/PIR, prepared as described in (a) above, was 
digested for 1 hour at 37°C with 40 units each of EcoRI and BamHI. Approximately 3 p.moles of the PIR 
proinsulin gene sequence were isolated from agarose gel. The isolated proinsulin gene sequence and 
phosphorlyated proinsulin gene analog sequence were ligated at 37°C for 3 hours in 30 ul of ligation buffer 
containing 20 units of T4 ligase, 40 units of EcoRI and BamHI. 

The ligation mixture was run on 1.0% agarose gel and, after autoradiography, the dimer, i.e. two 
proinsulin coding sequences, and trirner, i.e. three proinsulin coding sequences, were isolated from the gel 
by phenol extraction. The tetramer, pentamer and hexamer, i.e. four, five and six proinsulin coding 
sequences respectively, were obtained by further ligating the isolated fragments with additional proinsulin 
gene analogs following the general procedure described above. 

(c) Preparation of pTac and plac 239 recombinants containing multiple PI units 

The various Pi fragment constructs, containing from two proinsulin coding sequences up to six 
proinsulin coding sequences, were inserted into the vectors p7ac and plac 239 by ligation, as generally 
described in (b) above. The pTac and plac 239 vectors were formed by digestion with EcoRI and BamHI of 
pTac/PI and plac 239/PI respectively, which, in turn, were formed following the procedures which are 
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outlined below in sections (d) and (e). By inserting a proinsulin gene trimer into the plac 239 vector and a 
proinsulin gene pentamer into the pTac vector, there were obtained the fused and unfused multiple 
proinsulin gene plasmids plac 239/3PI and pTacJBPl respectively. The derivation and main restriction site 
map for the latter new plasmid appears as Figure 3. 
5 In similar manner, other multiple proinsulin gene plasmids, including plac 239/4PI, plac 239/5PI, p7ac/ 
3PI, p7ac/4PI, and p7ac/6Pl were constructed from the various multimer proinsulin genes and the plac 239 
and pTac vectors. The various constructs, containing from 1 to 6 copies of the proinsulin coding sequence, 
were confirmed by restriction mapping and DNA sequence analysis. 

10 (d) Preparation of plac 239-PI 

The derivation of plasmid plac 239-PI is illustrated in Figure 6 and was prepared from plasmid plac 504/ 
PI, as shown in detail in Figure 8. The formation of plasmid plac 504/Pi is described in more detail below in 
section (f). The plasmid plac 504/PI has a lac promoter, the p-galactosidase gene {z-gene) complete to the 
EcoRI site near the carboxy terminal of the gene with the human proinsulin gene insert of plasmid pBCA4 

is fused at this EcoRI site, so as to remain in reading frame, such that the translation of the z-gene continues 
directly to code for proinsulin. 

Referring to Figure 8, plasmid plac 504/PI was digested with Mstil and the resulting sticky ends were 
filled in with dTTP and dGTP. The 5'-T's were then removed by digestion with mung bean nuclease, the 
fragment cut with EcoRI to remove the Mst Il/EcoRI segment of the z-gene, the sticky ends of EcoRI were 

20 filled in with dATP and dTTP, and the plasmid was blunt-end ligated to establish a new EcoRI site, the 
resulting plasmid being designated as plac 239/PL The plasmid was cut with EcoRI at the new site, the 
fragment EcoRI tails were digested with mung bean nuclease, and the resulting fragment blunt-end ligated 
to form the plasmid plac 239-PI in which the correct PI reading frame was restored. In order to restore the 
correct PI reading frame in other fused plasmids, for example, plac 239/3PI, the EcoRI tails should also be 

25 removed by the same procedure described above, to form the corresponding expression plasmid plac 239- 
3PI. The derivation and main restriction site map for p!ac-3PI appears as Figure 2. 

(e) Preparation of pTac/PI 

The preparation of plasmid pTac/PI is illustrated in Figure 6 and was effected from plac 504/PI and from 
30 plasmid pDR540. The plasmid pDR540 is described by Russell et ai in Gene, Vol. 20, 231 (1982) and has the 
following features: a Hindill site upstream from the promoter, a -35 region from ftp,' the -10 operator, 
Shine-Delgarno, and Shine-Delgamo-to-start spacing regions of lac, with a BamHI site positioned at the 
start. The plasmid pDR540, purchased from P-L Biochemicals, Milwaukee, Wisconsin, U.S.A., was cut with 
BamHI and Hindill, the sticky ends removed with mung bean nuclease and the isolated Tac promoter was 
35 inserted by blunt-end ligation into the isolated Hindlll-EcoRl fragment of plasmid plac 504/PI which contains 
the PI gene, by first digesting plasmid plac/Pi with Hindill and EcoRI and then filling in the ends with DNA 
polymerase L In this way, the Tac promoter of the plasmid pDR540 is substituted for the wild-type lac 
promoter and the z-gene portion of plac 504/PI, thereby preserving the sequence, spacing and properties of 
the hybrid promoter, but with an EcoRI sequence approriately located. 

40 

(f) Preparation of plac 504/PI 

Plasmid plac 504/PI was prepared following the sequence of steps set forth in Figure 6 and the 
derivation and main restriction sites for the plasmid are illustrated in Figure 7. 

The initial step in the formation of plasmid plac 504/PI was the formation of plasmid plac 19 from 
45 known plasmids pMC1396 and M13mp7. The plasmid pMC1396 is described by Casadaban et al in J. Bact, 
Vol. 143, 971, (1980) while the plasmid M13mp7 is described by Messing et al in Nucleic Acids Research, 
Vol. 9, 309 (1981). Plasmid M13mp7 was purchased from BRL Inc., Gaithersburg, MO, U.S.A. and plasmid 
pMC1396 was a gift of M. Casadaban of Cambridge, MA, U.S.A. 

Plasmid pMC1396 was digested with Smal, treated with Bal 31 and ligated with Hindill linkers to form 
so plasmid plac 9. Plasmid pM13mp7 was digested with Avail and ligated to a Hindill linker to form plasmid 
plac 1. Both plasmid plac 9 and plac 1 were digested with Hindill and ligated to form the plac 19 plasmid. 

Plasmid pBCA4, containing the sequence coding for human proinsulin, was digested with EcoRI and 
Sail and the EcoRI — Sail fragment was isolated. Plasmid plac 19 was digested with EcoRI and Sail and the 
resulting vector was ligated with the EcoRI-Sall fragment of plasmid pBCA 4 to form plasmid plac 19/PI. 
65 Plasmid pBGP120 contains the lac promoter and is described in Polisky et al, Proc. Nat Acad. Sci. Vol. 
73, 3900 (1976). This plasmid obtained from Dr. R. Wu of Cornell University, Ithaca, N.Y., U.S.A., was 
digested with EcoRI and Hindill and the Hindlll-EcoRl fragment containing the lac promoter was separated. 
Following digestion of the piac19/PI plasmid with EcoRI and Hindill, the resulting vector was ligated with 
the Hindlll-EcoRl fragment of pfasmic pBGP120 to obtain the plasmid plac 504/PI. 

60 

Example 2 

This Example illustrates the preparation of converted microorganisms from the novel plasmids 
containing multiple proinsulin genes and expression of the polyproinsulin polypeptide. 

65 
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indeed other microorganisms could have been used. ... „ ua/m i of ammcillin YT medium 
density of 0.1. measured at a wavelength of 560 nm. 

(b) Recovery * P^£^ was continued for about 8 to 1 2 hours. The cells were harvested by 

microcentrifuge at 4°C. The centrifuged pellets were dissolved in the sample buffer and analyzea Dy bus 



PAGE. 



shown in Fipore 9. Proteins, partially purified as f^^^S^SS^SSl ft. pTeS 
IJfflT^P^ W^ttJ^IKS^ pro-ein moileoi.r »ei S n, 

""Si*-* enaivsj of 

232353252^ 

known to be typically obtained when expressing hybrid polypept.des in the pnor art. 

^TOs Example illustrates the isolation and purification of polypeptide fractions and the formation of 
insulin therefrom. 

60 containing 6 M guamdinium HCI and O.z M e ™ ano, ° m ' n * . J£ ' . '' h reac tj 0 n to occur for 2 hours at room 
to concentrations of 0.4 M and 0.8 M respect.vely ; and aHowmg he «~^0 mSJammonium acetate (pH 
temperature. The sulfitolyzed produ ct was prec-P-tated by dtartyj u agamst ™ mM. ^ 

Kets^ s the ratio of 200 ms per 25x10 cm 
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column. The products were differentially eluted by sodium chloride during application of a 0 to 500 mM 
linear NaCI gradient 

The sulfitolyzed biosynthetic human proinsulin (BHPI-SS0 3 ) was readily identifiable by HPLC-analysis 
of the salt gradient elution profile. BHPI-SS0 3 was collected and dialyzed at pH 4.5 to remove formamide 

s and to precipitate the product The pure BHPl-SS0 3 precipitate was dissolved in RH buffer, containing 50 
mM glycine and 5 mM EDTA (pH 10.5), at 0.5 mg/ml and cooled to -2°C. Ethanethiol was added to 800 urn 
and allowed to react for 24 hours to effect refolding of the molecule. The reaction was stopped by adjusting 
the pH to 7 with HCI and the reaction product was analyzed. Refolded biosynthetic human proinsulin (BHPS) 
constituted the monomeric component of the reaction product 

10 The BHPI was dissolved in TC buffer, containing 100 mM Tris-HCl (pH 7.6) and 10 mM CaCI 2 , at 2 mg/ 
ml, 10 pg/ml of trypsin and 40 ug/ml of carboxypeptidase B were added, and the mixture was allowed to 
react for 30 minutes at room temperature. The reaction was stopped by adding acetic acid to 1% and 
biosynthetic human insulin (BHI) was recovered by precipitation. The recovered yield was 98% of 
theoretical, based on the amount of BHPI used. 

is The BHI was considered authentic on the basis of HPLC analysis, amino acid composition, amino acid 
sequencing and biological activity. Human proinsulin in the form of tandemly-linked units, therefore, is 
stably expressed from transformed cells and human insulin can be readily formed from the expressed 
product 

In summary of this disclosure, the present invention provides a procedure for expressing polypeptides 
20 from organisms, which involves providing a DNA fragment containing a multiple number of 
oligonucleotide sequences each coding for a desired polypeptide product and tandemly-linked by 
oligonucleotides coding for easily-cleavable amino acid groupings, forming the fragment into a 
recombinant plasmid and transforming an organism by the plasmid. 

25 Claims 

1. A DNA fragment composed of repeating units coding for a desired product of known amino acid 
sequence tandemly-linked by oligonucleotides coding for a selectively cieavable amino acid sequence and 
having no codons for operative combinations of stop and start signals between the ends of said fragment, 

30 whereby the fragment is expressible in a suitable vehicle to produce a polypeptide composed of tandemly- 
linked repeating units of the said desired product 

2. A fragment as claimed in Claim 1 in a replicable cloning vehicle capable of expressing the fragment 
in a microbial organism. 

3. A fragment as claimed in Claim 1 or Claim 2, wherein said selectiveiy-cleavable amino acid sequence 
35 includes an aspartyl-proline bond so as to be selectiveiy-cleavable at that bond by acids. 

4. A fragment as claimed in Claim 1 or Claim 2 characterised in that the desired product is human 
proinsulin. 

5. A fragment as claimed in any of Claims 1, 2 or 4, characterised by from 2 to 6 repeating units each 
coding for human proinsulin and being tandemly-linked through olignucieotides coding for selectively 

40 cieavable amino acid sequences. 

6. A fragment as claimed in any of Claims 1, 2, 4 or 5, wherein said selectiveiy-cleavable amino acid 
sequence includes a methionine residue so as to be selectively cieavable at that residue by cyanogen 
bromide. 

7. A recombinant plasmid, comprising a plasmid vector and an expressible DNA fragment ligated 
45 thereto, characterised in that the DNA fragment is composed of repeating units coding for a desired 

product of known amino acid sequence tendemly-linked by oligonucleotides coding for a selectively 
cieavable amino acid sequence, and having no codons for operative combinations of stop and start signals 
between the ends of said fragment 

8. A plasmid as claimed in Claim 7 wherein said encoded selectiveiy-cleavable amino acid sequence 
so includes a methionine residue so as to be selectively cieavable at that residue by cyanogen bromide. 

9. A plasmid claimed in Claim 7 characterised in that the desired product is human proinsulin, the DNA 
fragment is joined to the 3'-side of a lac operon fragment encoding the first approximately 80 amino acids 
of P-galactosidase, and the fragment contains from 2 to 5 tandemly-linked oligonucleotide sequences 
coding for human proinsulin. 

55 10. A plasmid as claimed in Claim 7 characterised in that the desired product is human proinsulin, the 
DNA fragment is directly linked to a fragment of the cloning vector containing the Tac promoter followed 
by a bacterial Shine-Dalgarno sequence, and the fragment contains from 3 to 6 tandemly-linked 
oligonucleotide sequences coding for human proinsulin. 

1 1. A plasmid as claimed in any of Claims 7, 9 or 1 0 wherein said encoded selectiveiy-cleavable amino 
60 acid sequence includes a methionine residue so as to be selectively cieavable at that residue by cyanogen 

bromide. 

12. A microorganism which has been transformed by the plasmid claimed in any one of Claims 7 to 11 
for expression of said DNA fragment therefrom. 

13. A method of producing polypeptides by microorganisms, characterised by: 
65 (a) cleaving a vector at a predetermined restriction site; 
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lb) Dreoarinq an insert for the cleaved vector including a DNA fragment which has multiple copies of 
th» Sn?S 52 Ddvoeotide separated by oligonucleotide sequences coding for selectively cleavab le 
a^inTaci?seque P nces! a'nd whfch has no'codlns for operative combinations of stop and start s.gnals 

, ^Xt^^SU* vector to form a recombinant plasmid containing the insert; 
(6) transformina a microorganism with the recombinant plasmid. : mi . , 

(ei ^ gJowlnTsaid I microorganism in a culture medium to effect expression of a DNA fragment 
containina said multiple copies of the coding sequence for the polypeptide; 
^isolating from the microorganism a fraction constituting a polymer composed of the mulbple linked 

w conies of the polypeptide; and 

(□> cleavina the isolated multiple copy polypeptide to single polypeptides. 

14 A mThoJ as claimed in Ciaim 13 characterised in that step (f) includes sonic disruption of cells of 

thB t^td^^^^ - '.aimed <" «*« 

rs by growing ?aW mTcroorganismin a'culture medium to effect expression of a DNA fragment containing 
sid m3e copies of the coding sequence for the polypeptide and to produce a polymer composed of 
multiole inked copies of the polypeptide; and cleaving the polymer to produce said polypeptide. 

16 A So" i « f claimed in any one of Claims 13 to 15 characterised in that the polypeptide is human 

Pr0i "7 Ul A method as claimed in Claim 16, characterised in that the human proinsulin is converted into 
human insulin. 
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Patentanspruche 



1 DNA-Fragment, das aus wiederholten Einheiten, die fur ein gewOnschtes Produkt mit bekannter 
AminosyuVesTq& 

fetwobei die OHqonukleotide fur eine selektiv spaltbare Aminosauresequenz kodieren und kerne Kodons 
£ w5same 9 Kombination von Stopp- und Startsignalen zwischen den Enden des Fragmentes ha b _en 
wodurch das Fragment in einem geeigneten Vehikel exprimierbar ist urn e.n Polypep *d * ^J^das 
30 Tus Tandem-artig verbundenen wiederholten Einheiten des gewiinschten Produktes zusammengesetzt .st 
30 aus Tandem^ w y ^ ^ re p| iz ierbaren Klonierungsvehikel, das zur Expression des 

Fragmentes in einem mikrobiellen Organismus fahig ist. e .. wiu c _ a i tha r« 

3 Fraament nach Anspruch 1 oder 2, dadurch gekennzeichnet, daB die selektiv spaltbare 
^LS^^nB ^^ro^n^ einschlieBt, urn an dieser Bindung durch Sauren selektrv 

35 SP3lt 4 b FragmeS nach Anspruch 1 Oder 2, dadurch gekennzeichnet, daB das gewunschte Produkt humanes 

Pr ° in 5 S Fraament nach einem der Anspruche 1, 2 oder 4, gekennzeichnet durch zwei bis sechs wiederholte 
Einheiten 'r jeSs T humane? Proinsulin kodieren und Tandem-artig durch OI.gonukleot.de 
verbunden sind, die fur selektiv spaltbare Aminosauresequenzen kodieren. M , oW iw 

e^Sgmen nach einem der Anspruche 1, 2, 4 oder 5, dadurch 9 ek ™° hn R ^ 
spaltbare Aminosauresequenz einen Methioninrest umfaBt, urn so an d.esem Rest selektrv durch 

Br0 rReko S mWnantVs S ptsmid, umfassend einen Plasmidvektor und ein damit ligiertes exprimierbares 
DNA-Fragmem dadurch gekennzeichnet, daB das DNA-Fragment aus wiederholten Emhe en 
^;«mmpno P setzt ist die fGr ein gewunschtes Produkt bekannter Aminosauresequenz kodieren und 
tSSSGSS^ O.iSSnuW^de^rbunden sind, wobei die Oligonukleotide fur eine selekt^spaltbare 
AminoTaurelequenz kodieren und keine Kodons fur eine wirksame Komb.nat.on von Stopp- und 

Startsignalen zwischen den Enden des Fragmentes haben. Wo di ert e selektiv soaltbare 

8 Plasmid nach Anspruch 7, dadurch gekennzeichnet, daB die kod.erte seiewiv s P* n ™™ 
AmiLa" reTeque4 einen Methioninrest umfaBt, urn so an diesem Rest selektiv durch Bromcyan spaltbar 

2U S 9 in Plasmid nach Anspruch 7, dadurch gekennzeichnet. daB das gewunschte Produkt humanes 
ProinsuMn iS das DNA Fragment mit dem 3'-Ende eines Lac-Operon-Fragmentes verbunden ist, das d e 
e£e ^ S fAT>SL der B-Ga«actosidase kodiert. und daB das Fragment 2-5 Tandem-artig 
verbundene Oligonukleotidsequenzen, die fur humanes Proinsulin kodieren, enthalt 

10 PlasmS nach Anspruch 7, dadurch gekennzeichnet. daB das gewunschte : Produkt humanes 
Proinsulin ist das DNA-Fragment direkt an ein Fragment des Klonierungsvektors gebunden is ^ das den 
Tac Promoio enthalt, gefolgt von einer bakteriellen Shine-Dalgarno-Sequenz, und daB das Fragment 3-6 
irndem artg ve bundle Oligonukleotidsequenzen enthalt, die fur humanes Promsuhn kodieren. 

1 1 pSmid iach einem der Anspruche 7. 9 oder 10, dadurch gekennzeichnet, daB die kod.erte selektiv 
spaltbare ^ IminSesequenz einen Methioninrest umfaBt, urn so an d.esem Rest selektrv durch 

Br0 TMi kr^fg^smutder zur Expression des DNA-Fragmentes mit dem Plasmid nach einem der 
65 Anspruche 7—1 1 transformiert ist. 
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13. Verfahren zum Herstellung von Polypeptiden durch Mikroorganismen, gekennzeichnet durch: 

(a) Spalten eines Vektors an einer vorbestimmten Restriktionsschnittstelle; 

(b) Herstellen eines Inserts fur den gespaltenen Vektor einschlielSlich eines DNA-Fragmentes, das 
multiple Kopien des Genes fur das Polypeptid hat, die durch Oligonukleotidsequenzen getrennt sind, die 

5 fur selektiv spaltbare Aminosauresquenzen kodieren, und keine Kodons fur die wirksame Verbindung von 
Stopp- und Startsignalen zwischen den Enden des Fragmentes aufweist; 

(c) Lfgieren des Inserts mit dem gespaltenen Vektor, um ein das Insert enthaltendes rekombinsntes 
Plasmid zu bilden; 

(d) Transformieren eines Mikroorganismus mit dem rekombinanten Plasmid; 

w (e) Wachsenlassen des Mikroorganismus in einem Kuiturmedium, um die Expression eines DNA- 
Fragmentes, das die multiplen Kopien der kodierenden Sequenz fur das Polypeptid enthalt, zu bewirken; 

(f) Isolieren einer Fraktion aus dem Mikroorganismus, die das Polymer konstitutiert und aus multiplen 
verbundenen Kopien des Polypeptides zusammengesetzt ist; und 

(g) Spalten des isolierten Polypeptids mit multiplen Kopien zu einzelnen Polypeptiden. 

15 14. Verfahren nach Anspruch 13, dadurch gekennzeichnet, daS der Schritt (f) ein Aufbrechen der Zelien 
durch Beschallen des aus dem Kuiturmedium abgetrennten Mikroorganismus umfafct. 

15. Verfahren zum Herstellen von Polypeptiden aus einem Mikroorganismus nach Anspruch 12, 
gekennzeichnet durch Wachsenlassen des Mikroorganismus in einem Kuiturmedium, um die Expression 
eines mehrere Kopien der fur das Polypeptid kodierenden Sequenz enthaltenden DNA-Fragmentes zu 

20 bewirken und ein Polymer zu erzeugen, das aus multiplen verbundenen Kopien des Polypeptids besteht; 
und Spalten des Polymers, um das Polypeptid zu erzeugen. 

16. Verfahren nach einem der Anspruche 13 bis 15, dadurch gekennzeichnet, dafc das Polypeptid 
humanes Proinsulin ist. 

17. Verfahren nach Anspruch 16, dadurch gekennzeichnet, dafc das humane Proinsulin in humanes 
25 Insulin umgewandelt wird. 

Revendications 

1. Fragment d'ADN compose de motifs codant pour un produit desirS de sequence d'acides amines 
30 connue lies en tandem par des oligonucleotides codant pour une sequence d'acides amines clivabie 

selectivement et n'ayant pas de codons pour des combinaisons operatoires de signaux d'arret et de depart 
entre ies extremites dudit fragment, ce qui fait que le fragment peut etre exprirnS dans un vehicule 
approprie pour produire un polypeptide compose de motifs lies en tandem dudit produit desire. 

2. Fragment selon la revendication 1 dans un vehicule de colonage replicable capable d'exprimer le 
35 fragment dans un organisme microbien. 

3. Fragment selon la revendication 1 ou la revendication 2, dans lequel ladite sequence d'acides 
amines clivabie selectivement comprend une liaison sspartylproline de maniere a etre selectivement 
clivabie a cette liaison par des acides. 

4. Fragment selon la revendication 1 ou la revendication 2, caracteris6 en ce que le produit desir6 est la 
40 proinsuline humaine. 

5. Fragment selon Tune quelconque des revendications 1, 2, ou 4, caracterise par de 2 a 6 motifs codant 
chacun pour la proinsuline humaine et lies en tandem par des oligonucleotides codant pour des sequences 
d'acides amines selectivement clivables. 

6. Fragment selon Tune quelconque des revendications 1, 2, 4 ou 5, dans lequel ladite sequence 
45 d'acides amines selectivement clivabie comprend un reste methionine de maniere a etre selectivement 

clivabie a ce reste par du bromure de cyanogene. 

7. Plasmide recombinant comprenant un vecteur plasmidique et un fragment d'ADN exprimable qui y 
est Ite, caracterise en ce que le fragment d'ADN est compose de motifs codant pour un produit desire de 
sequence d'acides amines connu lies en tandem par des oligonucleotides codant pour une sequence 

so d'acides amines selectivement clivabie, et n'ayant pas de codons pour des combinaisons operatoires de 
signaux d'arret et de depart entre Ies extremites dudit fragment. 

8. Plasmide selon la revendication 7 dans lequel ladite sequence d'acides amines selectivement 
clivabie encodee comprend un reste methionine de maniere a etre selectivement clivabie a ce reste par du 
brumure de cyanogene. 

55 9. Plasmide selon la revendication 7 caracterise en ce que le produit desire est la proinsuline humaine, 
en ce que le fragment d'ADN est reuni au cdte 3' d'un fragment d'operon lac encodant Ies environ 80 
premiers acides amines de la p-galactosidase, et le fragment contient de 2 a 5 sequences 
d'oligonucleotides codant pour la proinsuline humaine liees en tandem. 

10. Plasmide selon la revendication 7 caracterise en ce que le produit desire est la proinsuline humaine, 
50- le fragment d'ADN est directement lie a un fragment du vecteur de clonage contenant le promoteurTac 

suivi par une sequence bacterienne de Shine-Dalgarno, et le fragment contient de 3 a 6 sequences 
d'oligonucleotides codant pour la proinsuline humaine liees en tandem. 

11. Plasmide selon I'une quelconque des revendications 7, 9 ou 10 dans lequel ladite sequence d'acides 
amines selectivement clivabie encoded comprend un reste methionine de maniere a etre selectivement 

65 clivabie a ce reste par du bromure de cyanogene. 
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12. Microorganisms qui a ete" transforme par Is plasmide revendique dans I'une quelconque des 
revendications 7 a 11 pour ('expression dudit fragment d'ADN. 

13. Procede de production de polypeptides par des microorganismes, caracterise en ce que: 
(a) un dive un vecteur en un site de restriction predetermine; 

5 (b) on prepare un insert pour le vecteur dive incluant un fragment d'ADN qui a des exemplaires 
multiples du gene pour ie polypeptide separe par des sequences ^oligonucleotides codant pour des 
sequences d'acides amines seiectivement clivabtes, et qui n'a pas de codons pour des combinaisons 
operatoires de signaux d'arret et de depart entre les extremites dudit fragment; 

(c) on lie I'insert au vecteur clive pour former un plasmide recombinant contenant I'insert; 

w (d) on transforme un microorganisme avec ie plasmide recombinant; 

(e) on cultive iedit microorganisme dans un milieu de culture pour effectuer ['expression d'un fragment 
d'ADN contenant lesdits exemplaires multiples de ia sequence de codage pour le polypeptide; 

(f) on isole du microorganisme une fraction constituant un polymere compose de multiples 
. exemplaires lies au polypeptide; et 

is (g) on dive le polypeptide a exemplaires multiples isole en polypeptides uniques. 

14. Procede selon la revendication 13, caracterise en ce que I'etape {f) comprend une rupture par 
ultrasons des cellules du microorganisme separees du mileu de culture. 

15. Procede de production de polypeptides a partir d'un microorganisme selon la revendication 12, 
caracterise en ce qu'on cultive iedit microorganisme dans un milieu de culture pour effectuer I'expression 

20 d'un fragment d'ADN contenant lesdits exemplaires multiples de la sequence de codage pour le 
polypeptide et pour produire un polymere compose de multiples exemplaires lies au polypeptide; et en ce 
qu'on clive le polymere pour produire Iedit polypeptide. 

16. Procede selon I'une quelconque des revendications 13 a 15 caracterise en ce que le polypeptide est 
ia proinsuline humaine. 

25 17. Procede selon ia revendication 16, caracterise en ce que ia proinsuline humaine esttransformee en 
insuline humaine. 
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